CDS

Accession Number TCMCG019C11832
gbkey CDS
Protein Id XP_022940434.1
Location complement(join(5067299..5067358,5068453..5068619,5069616..5069721,5070623..5070660,5070913..5071000,5071646..5071699,5071785..5071893,5072025..5072107,5072215..5072277,5072391..5072453,5073587..5073685,5073942..5073980,5074072..5074127,5074427..5074541,5074656..5074825,5075214..5075317,5075469..5075563,5075901..5075983,5076064..5076244))
Gene LOC111446044
GeneID 111446044
Organism Cucurbita moschata

Protein

Length 590aa
Molecule type protein
Topology linear
Data_file_division PLN
dblink BioProject:PRJNA418582
db_source XM_023084666.1
Definition imidazole glycerol phosphate synthase hisHF, chloroplastic-like [Cucurbita moschata]

EGGNOG-MAPPER Annotation

COG_category E
Description Belongs to the HisA HisF family
KEGG_TC -
KEGG_Module M00026        [VIEW IN KEGG]
KEGG_Reaction R04558        [VIEW IN KEGG]
KEGG_rclass RC00010        [VIEW IN KEGG]
RC01190        [VIEW IN KEGG]
RC01943        [VIEW IN KEGG]
BRITE ko00000        [VIEW IN KEGG]
ko00001        [VIEW IN KEGG]
ko00002        [VIEW IN KEGG]
ko01000        [VIEW IN KEGG]
KEGG_ko ko:K01663        [VIEW IN KEGG]
EC -
KEGG_Pathway ko00340        [VIEW IN KEGG]
ko01100        [VIEW IN KEGG]
ko01110        [VIEW IN KEGG]
ko01230        [VIEW IN KEGG]
map00340        [VIEW IN KEGG]
map01100        [VIEW IN KEGG]
map01110        [VIEW IN KEGG]
map01230        [VIEW IN KEGG]
GOs -

Sequence

CDS:  
ATGGAAGCAACGCCGTTCTCTATCGCTGTTTCTTCTTCTCGGACTGTAATTCGATCATTTCCGTCGTCGTTTCATACCAGCTCTCTCGTTTTTCTTCGCAATAATCGTTACAAAACTTGTCATCTCAAAGTTAAGTCCACCGGTAAGTTTGCGGTTCGTGCCTCATTTTCGGGTGACTCAGTCGTGACTTTGCTGGATTACGGTGCTGGTAATGTTCGTAGTGTGAGGAATGCAATTCGTTACCTTGGCTTCGACATTAAAGATGTGCAAACTCCAGAGGACATTCTGAATGCAAAACGCCTAATATTTCCTGGAGTTGGGGCATTTGCTCCAGCCATGGATGTGCTAAACAATAAAGGAATGGCTGAAGCACTCTGTACTTATATTGAGAATGATCGCCCATTTTTAGGCATTTGTCTTGGGCTTCAATTACTCTTCGAATCAAGTGACGAGAATGGACCAGTAAAAGGACTTGGCTTAATACCGGGTGTGGTTGGGCGTTTCGACTCTTCCAATGGTTTTAGTGTACCCCATATTGGGTGGAATGCTTTGGAAATCTCAGATGACTCTGAGATCTTGGATGATATTTGTAATCGTCATGTCTACTTTGTTCACTCTTACCGTGCTATGCCATCTGACAAGAACAAGGAGTGGATCTCTTCTACTTGCAGCTATGGTGACAGGTTTATAGCCTCAGTTAGAAGGGGAAATGTCCATGCAGTTCAATTCCACCCAGAAAAGAGTGGAGATGTAGGCCTGTCTGTCCTCAGAAGATTCTTGCTTCCAAAGTCAACTTTGACCAAGAAACCTACTGAGGGGAAGGCATCAAGGCTTGCAAAAAGGGTAATTGCTTGTCTTGATGTGCGGACAAATGATCAAGGGGATCTTGTTGTTACCAAAGGGGACCAATATGACGTAAGGGAGCAATCAGAAGAGAATGAGGTGAGGAACCTTGGCAAACCGGTTGAGCTTGCTGGACAGTACTACAAGGATGGAGCTGACGAGGTCAGTTTTTTGAATATAACTGGTTTCCGTGACTTCCCTCTTGGCGACTTGCCAATGTTGCAGGTGCTGCGATACACATCAGAAAATGTTTTTGTACCATTGACTGTTGGTGGCGGAATTAGAGATTTTACGGATGCAAATGGCAGACACTATTCTAGCTTGGAAGTTGCTTCAGAATATTTCAGATCTGGAGCTGATAAAATATCTATCGGAAGTGATGCAGTTTATGCTGCTGAGGAATATTTAAGAACTGGTGTAAAGACTGGAAAGAGCAGCTTGGAACAGATTTCTACTGTTTATGGAAATCAGGCTGTCGTGATAAGTATTGATCCTCGAAGAGTGTACCTTAAAAGTCCTGATGATGTGGAGTTCAAAGTCATTCGAGTTACTAACCCAGGTCCTAATGGAGAAGAATATGCATGGTATCAGTGTACAGTGAATGGAGGTCGAGAAGGTCGACCTATTGGAGCTTATGAGCTTGCAAAAGCAGTTGAGGAGCTTGGAGCTGGAGAAATACTGCTAAATTGCATAGATTGTGATGGTCAAGGAAAAGGATTTGATTTAGATCTAGTAAAGCTGATATCGGATTCTGTGAGCATCCCTGTTATTGCCAGCAGCGGTGCTGGGTGTTCTGACCATTTCTCAGAGGTGTTCAACAAGACAAATGCATCTGCTGCCTTAGCTGCTGGCATTTTCCATCGCAAGGAGGTGGCTATTCAGTCCGTAAAAGGGCATTTATTAAAGGAAGGCATAGAGGTCAGAATGTAA
Protein:  
MEATPFSIAVSSSRTVIRSFPSSFHTSSLVFLRNNRYKTCHLKVKSTGKFAVRASFSGDSVVTLLDYGAGNVRSVRNAIRYLGFDIKDVQTPEDILNAKRLIFPGVGAFAPAMDVLNNKGMAEALCTYIENDRPFLGICLGLQLLFESSDENGPVKGLGLIPGVVGRFDSSNGFSVPHIGWNALEISDDSEILDDICNRHVYFVHSYRAMPSDKNKEWISSTCSYGDRFIASVRRGNVHAVQFHPEKSGDVGLSVLRRFLLPKSTLTKKPTEGKASRLAKRVIACLDVRTNDQGDLVVTKGDQYDVREQSEENEVRNLGKPVELAGQYYKDGADEVSFLNITGFRDFPLGDLPMLQVLRYTSENVFVPLTVGGGIRDFTDANGRHYSSLEVASEYFRSGADKISIGSDAVYAAEEYLRTGVKTGKSSLEQISTVYGNQAVVISIDPRRVYLKSPDDVEFKVIRVTNPGPNGEEYAWYQCTVNGGREGRPIGAYELAKAVEELGAGEILLNCIDCDGQGKGFDLDLVKLISDSVSIPVIASSGAGCSDHFSEVFNKTNASAALAAGIFHRKEVAIQSVKGHLLKEGIEVRM